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Abstract: Recently we have described the discovery and complete genome sequence of a novel coronavirus 
associated with pneumonia, coronavirus HKU1 (CoY-HKUl). In this study, a detailed in silico analysis of 
the ORFlab, encoding the 7,182-amino acid replicase polyprotein in the CoY-HKUl genome showed that 
the replicase polyprotein of CoV-HKUl is cleaved by its papain-like proteases and 3C-like protease 
(3CL pro ) into 16 polypeptides homologous to the corresponding polypeptides in other coronaviruses. Sur¬ 
prisingly, analysis of the putative cleavage sites of the 3CL pro revealed a unique putative cleavage site. In all 
known coronaviruses, the PI positions at the cleavage sites of the 3CE" 1 '" are occupied by glutamine. This is 
also observed in CoV-HKUl, except for one site at the junction between nsplO (helicase) and nspll (mem¬ 
ber of exonuclease family), where the PI position is occupied by histidine. This amino acid substitution is 
due to a single nucleotide mutation in the CoV-HKUl genome, CAG/A to CAT. This probably represents a 
novel cleavage site because the same mutation was consistently observed in CoV-HKUl sequences from 
multiple specimens of different patients; the P2 and PU-P12' positions of this cleavage site are consistent 
between CoV-HKUl and other coronaviruses; and as the helicase is one of the most conserved proteins in 
coronaviruses, cleavage between nsplO and nspll should be an essential step for the generation of the 
mature functional helicase. Experiments, including purification and C-terminal amino acid sequencing of 
the CoV-HKUl helicase and trans-cleavage assays of the CoV-HKUl 3CL pr “ will confirm the presence of 
this novel cleavage site. 
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Coronaviruses, a genus of the family Coronaviridae, 
are large, enveloped, positive-stranded RNA viruses 
that cause respiratory, enteric, hepatic, and neurological 
diseases in humans and domestic animals. In humans, it 
has been estimated that coronaviruses [human coron¬ 
aviruses 229E (HCoV-229E) and OC43 (HCoV-OC43)] 
are responsible for causing about 5-30% of respiratory 
tract infections. In late 2002 and 2003, a novel clinical 
syndrome, Severe Acute Respiratory Syndrome 
(SARS), affected 30 countries on five continents with 
more than 8,000 cases and 750 deaths. A novel corona- 
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virus, the SARS coronavirus (SARS-CoV), has been 
confirmed to be the aetiological agent, and its genome 
has been completely sequenced (13, 28, 32, 36, 37, 39, 

Abbreviations: Appr>p, ADP-ribose 1",2 "-cyclic phosphate; 
Appr-l"p, ADP-ribose 1 "-phosphate; ADRP, ADP-ribose 1"- 
phosphatase; BCoV, bovine coronavirus; 3CL pro , 3C-like pro¬ 
tease; CoV-HKUl, coronavirus HKU1; CPDase, cyclic 
nucleotide phosphodiesterase; E, envelope; ExoN, 3-to-5' 
exonuclease; HCoV-229E, human coronavirus 229E; HCoV- 
NL63, human coronavirus NL63; HCoV-OC43, human corona¬ 
virus OC43; IBV, infectious bronchitis virus; M, membrane; 
MHV, murine hepatitis virus; N, nucleocapsid; 2-O-MT, S- 
adenosylmethionine-dependent ribose 2'-0-methyltransferase; 
ORF, open reading frame; PL pro , papain-like proteases; pol, 
RNA-dependent RNA polymerase; RT-PCR, reverse transcrip¬ 
tase polymerase chain reaction; S, spike; SARS, Severe Acute 
Respiratory Syndrome; SARS-CoV, SARS coronavirus; Xen- 
doU, poly(U)-specific endoribonuclease. 
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51-53). In early 2004, a novel human coronavirus 
associated with upper respiratory tract infections, 
human coronavirus NL63 (HCoV-NL63), was also dis¬ 
covered (11, 47). Phenotypic ally, the envelopes of all 
coronaviruses are studded with long, petal-shaped 
spikes, resulting in the appearance of a crown under the 
electron microscope. Genotypically, coronaviruses pos¬ 
sess the largest genomes of all RNA viruses of about 
30 kb. As a result of a unique mechanism of viral repli¬ 
cation, coronaviruses have a high frequency of recombi¬ 
nation (24, 25, 29, 31). The order of the genes in the 
genomes of all coronaviruses that encode the RNA- 
dependent RNA polymerase (pol) and the four structural 
proteins present in all coronaviruses is 5 '-pol, spike (S), 
envelope (E), membrane (M), nucleocapsid (N)-3'. 
Based on serological and genotypic characterization, 
coronaviruses were divided into three distinct groups, 
with HCoV-229E and HCoV-NL63 being group 1 coro¬ 
naviruses and HCoV-OC43 a group 2 coronavirus 
respectively (11, 26, 47). 

Recently, we have described the discovery of a group 
2 novel coronavirus associated with pneumonia, coro¬ 
navirus HKU1 (CoV-HKUl), and its complete genome 
sequence (48-50). In the genome of CoV-HKUl, more 
than two thirds of the genome was made up of a single 
open reading frame (ORF), ORFlab, which encodes the 
putative replicase polyprotein. In other coronaviruses, 
this polyprotein, after cleavage by proteases encoded by 
this ORF, gives rise to more than 10 proteins, of which 
many are important enzymes essential for the survival of 
the coronaviruses. In this article, we describe a detailed 
in silico analysis of this ORF in CoV-HKUl, which 
reveals a unique putative cleavage site of the 3C-like 
protease (3CF pr °) in coronavirus HKU1. Other similari¬ 
ties and differences of this ORF in CoV-HKU 1 as com¬ 
pared to other coronaviruses are also discussed. 

Materials and Methods 

Viral sequences. The predicted amino acid 
sequences of ORFlab in CoV-HKUl were extracted 
from the CoV-HKUl genome sequence (GenBank 
accession no. NC_006577) (48). The corresponding 
amino acid sequences of ORFlab in other coronaviruses 
were extracted from GenBank (HCoV-OC43, GenBank 
accession no. NP_937947; BCoV, GenBank accession 
no. NP_150073; MHV A59, GenBank accession no. 
P16342; HCoV-229E, GenBank accession no. 
NP_073549; IBV, GenBank accession no. NP_066134; 
and SARS-CoV, GenBank accession no. NP_828849). 
The amino acid sequences of the RNA-dependent RNA 
polymerases in hepatitis C virus, rabbit hemorrhagic 
disease virus and poliovirus 1 and their secondary struc¬ 


tures were retrieved from Protein DataBank (PDB ID 
1QUV, 1KHV and 1RDR respectively). 

In silico analysis. Multiple alignment was per¬ 
formed using ClustalX 1.83 (46). Protein family analy¬ 
sis was performed using PFAM (2). Secondary struc¬ 
ture prediction was performed using PROFsec (38). 
Three-dimensional modeling of 3CF pr ° of CoV-HKUl 
was performed using 3CF pr ° of SARS-CoV as the tem¬ 
plate (54). Manually corrected sequence alignment and 
homology modeling requirement were submitted via 
Deep View-spdbv 3.7 (14) to Swiss-model Protein 
Modeling web server (http://swissmodel.expasy.org/) 
(40). The 3D structure was displayed using MOFMOF 
2K.2 (22). Phylogenetic tree construction was per¬ 
formed using neighbour joining method with ClustalX 
1.83. 

Results and Discussion 

General Overview 

The replicase polyprotein (7,182 amino acids) of 
CoV-HKUl is translated from ORFla (13,395 nt) and 
ORFlb (8,154 nt). Similar to other coronaviruses, a 
slippery sequence (UUUAAAC), followed by 
sequences that form a putative pseudoknot structure, are 
present (Fig. 1) (5, 6). Translation will presumably 
occur by a -1 RNA-mediated ribosomal frameshift at 
the end of the slippery sequence. Therefore, instead of 
reading the transcript as UUUAAACGGG, it will be 
read as UUUAAACCGGG. In infectious bronchitis 
virus (IBV) and SARS-CoV, site directed mutagenesis 
experiments have confirmed the importance of the slip¬ 
pery sequence as well as the pseudoknot structure for 
the frameshift to occur (7, 45). 

Proteolytic Cleavage of Replicase Polyprotein 

Multiple alignments between the replicase polypro¬ 
tein of CoV-HKUl and those of other coronaviruses 
reveal the orthologs of two types of proteases, papain¬ 
like proteases (PF pro ) and 3CF pro . As in other corona¬ 
viruses and by multiple alignment and analysis of 
sequences around consensus cleavage sites, the repli¬ 
case polyprotein of CoV-HKUl is cleaved by its PF pro 
and 3CF pro into 16 polypeptides (Table 1). The three N- 
terminal cleavage sites are putatively cleaved by the 
PF pro , whereas the others are putatively cleaved by the 
3CF pro . 

The putative PF pr0 cleavage sites in the replicase 
polyprotein of CoV-HKUl follow the general rules 
determined by site directed mutagenesis experiments in 
other group 2 coronaviruses. In MHV, site directed 
mutagenesis studies have shown that at the p28/p65 
cleavage site, a basic residue (arginine or lysine) in the 
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P5 position, arginine in the P2 position, and glycine in 
the PI position are the critical amino acids for PLl pr ° 
recognition and processing (17). In CoV-HKUl, argi¬ 
nine, arginine and glycine are present in these three 
positions respectively (Table 2). In MHV, site directed 
mutagenesis studies have shown that at the p65/nspl 
cleavage site, arginine in the P5 position, alanine in the 
PI position, and glycine in the PI' position are the criti¬ 
cal amino acids for PLl pr ° recognition and processing 
(4). In CoV-HKUl, arginine, alanine and glycine are 
present in these three positions respectively (Table 2). 
In MHV, site directed mutagenesis studies have shown 
that at the nspl/p44 cleavage site, phenylalanine in the 
P6 position, glycine in the P2 position, and glycine in 
the PI position are the critical amino acids for PL2 pro 
recognition and processing (20). In CoV-HKUl, 
phenylalanine, glycine and glycine are present in these 
three positions respectively (Table 2). 

In contrast to the putative PL pr ° cleavage sites, analy¬ 
sis of the putative cleavage sites of the 3CL pro in the 
replicase polyprotein of CoV-HKUl revealed a unique 
putative cleavage site by the 3CL pro of CoV-HKU 1. The 
3CL pro of coronaviruses are named as such because their 
structures and substrate specificities resemble those of 
the 3C proteases in picomaviruses. In all known coro¬ 
naviruses, the PI positions at the cleavage sites of the 
3CL pro are occupied by glutamine, whereas the PI' posi¬ 
tions are occupied by aliphatic amino acids, including 
alanine, glycine, cysteine, asparagine and serine (3, 9, 


Table 1. Characteristics of the 16 putative polypeptides derived from replicase polyprotein in CoV-HKUl 


Polypeptides 

First amino ac id p “ ti ” i "“ pll “' p » 1 n’”>'<=» 

-Last amino 

Number of amino 

acid residues 

Characteristics 

p28 

Met-Gly 222 

222 

Unknown 

p65 

Ile 223 -Ala 809 

587 

Unknown 

nspl 

Gly 810 -Gly 2838 

2,029 

ATR, AC, ADRP, 
PLl 1 ”, PL2 pro , Y domain 

p44 

Val 2839 -Gln 3334 

496 

Hydrophobic domain 

nsp2 

Ser 3335 -Gln 3 ' S37 

303 

3CL pro 

nsp3 

Ser 3638 -Gln 3924 

287 

Hydrophobic domain 

nsp4 

Ser 3925 -Gln 4013 

92 

Unknown 

nsp5 

Ala 4014 -Gln 4210 

194 

Unknown 

nsp6 

Asn 4211 -Gln 4320 

110 

Unknown 

nsp7 

Ala 4321 -Gln 4457 

137 

Unknown 

nsp8 

Ser 4458 -Val 4471 

14 

Unknown (short peptide 
at the end of ORFla) 

nsp9 

Ser 4458 -Gln 5385 

928 

Pol 

nsplO 

Ser 5386 -His 5988 

603 

Hel 

nspll 

Cys 5989 -Gln“ 09 

521 

ExoN 

nspl 2 

Ser“ 10 -Gln 6883 

374 

XendoU 

nspl 3 

Ala 68S4 -Cys 7182 

299 

2-O-MT 


ATR, acidic tandem repeats; AC, acidic domain; ADRP, ADP-ribose 1 "-phosphatase; PLl 1 ”, papain-like 
protease 1; PL2 pr °, papain-like protease 2; 3CL pro , 3C-like protease; Pol, RNA-dependent RNA polymerase; 
Hel, helicase; ExoN, 3’-to-5' exonuclease; XendoU, poly(U)-specific endoribonuclease and 2’-0-MT, S- 
adenosylmethionine-dependent 2'-0-ribose methyltransferase. 
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Fig. 1. Model of ribosomal frameshift element in CoV-HKUl. It 
consists of a slippery sequence (UUUAAAC) and a pseudoknot 
structure made up by two stems and two loops. 
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Table 2. Analysis of amino acid residues around putative cleavage sites of PL pro and 3CL pm in replicase polyprotein of 


CoV-HKUl 






Amino acid residues around putative cleavage sites 


Putative 





CoV-HKUl 







MHV, BCoV, 

Putatively 
cleaved by 

polypeptides 

Positions 

(P1IP1’) 

6 

5 

4 

3 

2 

1 

r 

2' 

3' 

4’ 

5' 

6’ 

HCoV-OC43 

dll') 

p28|p65 

222|223 

L 


Q 

Y 

R 

G 

i 

K 

P 

V 

L 

F 

G|V 

PLF” 

PL2 pr ° 

p65|nspl 

809|810 

W 


F 

P 

cl 

A 

G 

R 

K 

V 

N 

F 

_«> 

nspl|p44 

2838|2839 

D 

S 

L 

K 

□ 

G 

V 

V 

L 

s 

N 

L 

G|A 

p44|nsp2 

3334|3335 

s 

T 

S 

F 

L 

Q 

s 

G 

I 

V 

K 

M 

— 


nsp2|nsp3 

3637|3638 

A 

G 

V 

K 

L 

Q 

s 

K 

T 

K 

R 

F 

— 


nsp3|nsp4 

3924|3925 

E 

V 

s 

Q 

I 

Q 

s 

K 

L 

T 

D 

V 

— 


nsp4|nsp5 

4013|4014 

D 

S 

T 

V 

L 

Q 

A 

L 

Q 

S 

E 

F 

— 


nsp5|nsp6 

4210|4211 

A 

N 

A 

V 

M 

Q 

N 

N 

E 

L 

M 

P 

— 


nsp6|nsp7 

4320|4321 

S 

T 

1 

R 

L 

Q 

A 

G 

V 

A 

T 

E 

— 

3CL pr ° 

nsp7|nsp9 

4457|4458 

S 

S 

V 

A 

V 

Q 

S 

K 

D 

L 

N 

F 

— 


nsp9|nspl0 

5385|5386 

K 

S 

A 

V 

M 

Q 

S 

V 

G 

A 

C 

V 

— 


nspl0|nspll 

5988|5989 

T 

L 

P 

R 

L 

H 

c 

T 

T 

N 

L 

F 

QIC 


nspll|nspl2 

6509|6510 

T 

F 

T 

T 

L 

Q 

s 

L 

E 

N 

V 

1 

— 


nspl2|nsp!3 

6883|6884 

F 

Y 

P 

K 

M 

Q 

A 

T 

N 

D 

w 

K 

— 



“ ) 1| 1' amino acid residues in CoV-HKUl are the same as those in MHV, BCoV and HCoV-OC43. 

The amino acid residues critical for cleavage by PLl pro in MHV as determined by site directed mutagenesis studies 


are highlighted. 


CoV-HKUl 

5985 
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CoV-HKUl: 5988 
MHV: 5982 
BCoV: 5900 
HCOV-0C43:5900 


5989 

5983 

5901 
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Fig. 2. Multiple alignment of the amino acid sequences around the putative cleavage site between nsplO and 
nspll in CoV-HKUl, MHV, BCoV and HCoV-OC43. The conserved amino acids from positions P2 to P12’in 
the four coronaviruses are highlighted. 


16, 23). In most cases, the PI' positions are occupied by 
alanine, glycine or serine. Occasionally, in group 2 
coronaviruses, the PI' positions are occupied by cys¬ 
teine (30); and in rhinoviruses, by asparagines (3). In 
CoV-HKUl, multiple alignment with the genome 
sequences of other coronaviruses revealed that all, 
except one, PI positions of putative cleavage sites of 
3CL pr ° are occupied by glutamine. The exception lies at 
the junction between nsplO (helicase) and nspl 1 (mem¬ 
ber of exonuclease family), where the PI position is 
occupied by histidine. This probably represents a novel 
cleavage site due to the following reasons. First, the 
sequence is genuine instead of a result of RT-PCR or 
sequencing errors because the same position has been 
amplified and sequenced four times in different clinical 


specimens of the index patient (48). Second, the same 
mutation was consistently found in the genome 
sequence of the CoV-HKUl in the second patient with 
pneumonia (data not shown). Therefore, it is unlikely 
that it is due to an occasionally occurring mutation in 
just one patient. Third, in all other group 2 corona¬ 
viruses with genome sequences available (MHV, 
BCoV and HCoV-OC43), the P2, PI', P2', P3', P4', P5', 
P6', P7', P8', P9', P10', PI 1' and P12' positions of this 
cleavage site are all occupied by leucine/valine [shown 
to be important for 3CL pro cleavage (35)], cysteine, ser¬ 
ine/threonine, threonine, asparagines, leucine, phenyl¬ 
alanine, lysine, aspartic acid, cysteine, serine, 
lysine/arginine and serine respectively, which are also 
present at the corresponding positions in CoV-HKUl 
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(Fig. 2). Fourth, this amino acid substitution is probably 
due to a single nucleotide mutation in the CoV-HKUl 
genome, CAG (as in MHV) or CAA (as in BCoV and 
HCoV-OC43) (which encodes glutamine) to CAT 
(which encodes histidine). Finally, as the helicase is 
one of the most conserved proteins in coronaviruses, 
cleavage between nsplO and nspl 1 should be an essen¬ 
tial step for the generation of the mature functional heli¬ 
case. The present novel putative cleavage site, as well 
as some other atypical cleavage sites by 3CL P '°, were 
not predicted by a program for prediction of 3CL pr ° 
cleavage sites (21). Experiments, including purification 
of the CoV-HKU 1 helicase in its native form and C-ter- 
minal amino acid sequencing and trans-cleavage assays 
of the CoV-FIKUl 3CL pro will confirm the presence of 
this novel cleavage site. 

Putative Polypeptides of the CoV-HKUl Replicase 
Polyprotein 

The replicase polyprotein of CoV-HKU 1 is putatively 
cleaved by its PL pro and 3CL pro into 16 polypeptides 
(Table 1). Among these 16 putative polypeptides, p28, 
p65, nsp3, nsp4, nsp5, nsp6, nsp7 and nsp8 have no 
homologues with known functions by BLAST search. 

Similar to other coronaviruses, the N terminal of the 
putative nspl consists of an acidic domain (amino acids 
1-333 of nspl). However, unlike other coronaviruses, 
there are 16 tandem copies of a 10-amino acid repeat 
near the C terminal of the acidic domain. The first 14 
tandem copies are perfect repeats of NDDENVVTGD, 
with four of the 10 amino acids being either glutamic 
acid or aspartic acid. The last two copies are imperfect 
repeats of NNDEEIVTGD and NDDQIVVTGD 
respectively. In picornaviruses, it has been found that 
there were three tandem copies of 24 imperfect amino 
acid repeats, named VPgl, VPg2 and VPg3 (10). How¬ 
ever, no specific functions have been found for these 
tandem repeats. Downstream to the acidic domain is 
the PLl pro (amino acids 334-549 of nspl), with the 
characteristic catalytic dyad of cysteine and a down¬ 
stream histidine. Moreover, similar to the PLl pr ° of 
other coronaviruses, four conserved cysteine residues 
(Cys 429 , Cys 432 , Cys 455 and Cys 457 ) that may contain a 
Zn 2+ binding motif are present. Downstream to PLl pro is 
the X domain (amino acids 550-709 of nspl) that con¬ 
tains putative ADP-ribose 1 "-phosphatase (ADRP) 
activity [amino acids 575-685 of nspl belonging to 
Appr-l-p processing enzyme family (Pfam accession 
no. PF01661)]. The putative ADRP activity was first 
described for this domain by Snijder et al. (43). In 
other microorganisms, such as Saccharomyces cerevisi- 
ae and other eukaryotes, ADRP and its functionally 
related enzyme cyclic nucleotide phosphodiesterase 


(CPDase), were important for tRNA processing (33). 
ADP-ribose 1",2 "-cyclic phosphate (Appr>p) is pro¬ 
duced as a result of tRNA splicing. Appr>p is convert¬ 
ed to ADP-ribose 1 "-phosphate (Appr-1 "p) by CPDase. 
Appr-1 "p is then further processed by ADRP. In other 
group 2 coronaviruses, both putative ADRP and 
CPDase (in NS2a) homologues are present (43). Inter¬ 
estingly, in CoV-HKUl and also SARS-CoV, only 
ADRP, but not CPDase [NS2a is not present in CoV- 
HKUl (48)], is present. Downstream to the X domain, 
there is a segment of 134 amino acids, also present in 
other coronaviruses, with no matches of known func¬ 
tions found by BLAST search. Downstream to this 
segment of unknown function is the putative PL2 pr ° 
(amino acids 844-1140 of nspl), with the characteristic 
catalytic dyad of cysteine and a downstream histidine. 
Similar to the PL2 pr ° of other coronaviruses, four con¬ 
served cysteine residues (Cys 1026 , Cys 1028 , Cys 1060 and 
Cys 1062 ) that may contain a Zn 2+ binding motif are also 
present. At the C terminal of nspl, similar to other 
coronaviruses, there is a Y domain with two hydropho¬ 
bic stretches and 11 conserved Cys/His residues present 
in the N terminal 889 amino acids (30). Ziebuhr et al. 
speculated that this Y domain may be responsible for 
anchoring nspl into membranes and binding cations 
(55). Downstream to nspl, p44 (496 amino acid 
residues), putatively cleaved from nspl by PL2 pr °, with a 
hydrophobic transmembrane domain, is present. 

Three-dimensional structure modeling of the putative 
3CL pr ° (nsp2) of CoV-HKU 1 using the three-dimension¬ 
al structure of SARS-CoV as the template revealed a 
similar structure for the monomer of the CoV-HKUl 
3CL pro as compared to the 3CL pro of SARS-CoV and 
HCoV-229E (Fig. 3). Three domains were present, 
domain I (8-101 amino acid residues), domain II 
(102-184 amino acid residues) and domain III 
(201-303 amino acid residues). Domains I and II con¬ 
sist of six strands of antiparallel [3 barrels and domain III 
is a globular structure with five a helices. The catalytic 
dyad of the protease is located in a cleft between 
domains I (His 41 ) and II (Cys 165 ). 

As in other coronaviruses, the putative Pol (nsp9) of 
CoV-HKUl consists of the N-terminal domain, the fin¬ 
gers subdomain, the palm subdomain and the thumb 
subdomain (Fig. 4). The N-terminal domain (residues 
1-371) of the Pol of CoV-HKUl shares 52-87% amino 
acid identities with those of other coronaviruses but 
<31% amino acid identities with those of other RNA 
viruses. The function of the N-terminal domain is 
unclear. On the other hand, the fingers (motifs F and 
G), palm (motifs A-E) and thumb subdomains of the 
Pol of CoV-HKUl are homologous to those of other 
coronaviruses and some positive-stranded RNA virus- 
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Fig. 3. Three-dimensional structure modeling for monomer of 
CoV-HKU 1 3CL P ”. The a helices are shown in red and P barrels 
in blue. The N and C termini are labeled N and C respectively. 
The catalytic dyad (His 41 and Cys 163 ) are indicated. 


es, such as hepatitis C vims (of family Flaviviridae) 
(1), rabbit hemorrhagic disease vims (of family Cali- 
civiridae) (34) and poliovirus 1 (of family Picornaviri- 
dae) (15) (Fig. 4). The critical amino acids, aspartic 
acids in motif A and XDD sequence in motif C, which 
have been demonstrated to be essential for the function 
of Pol in other coronaviruses and positive-stranded 
RNA viruses, are also present in Pol of CoV-FIKU 1. 

Similar to the helicase of SARS-CoV, the putative 
helicase (nsplO) of CoV-FIKU 1, which unwinds duplex 
RNA by nucleoside triphosphate hydrolysis activity, 
possesses six conserved motifs (18) (Fig. 5). Similar to 
other coronavimses, based on the presence of conserved 
motifs, the putative helicase of CoV-FIKUl should 
belong to superfamily 1 of the three superfamilies of 
RNA helicases (12, 19). Furthermore, all nine cysteine 
residues and three histidine residues that are conserved 
in the N termini of the helicases in nidovimses are pres¬ 
ent in the putative helicase of CoV-FIKUl (Fig. 5). It 
has been suggested that this cysteine/histidine region 
constitutes a Zn 2+ -binding domain, which controls the 
activities of the catalytic domain (42). Similar to the 
helicases of other nidoviruses and most other helicases 
of superfamily 1, the helicase of CoV-HKU 1 probably 
unwinds the RNA duplex in a 5' to 3' fashion (41, 44). 

The putative nspll possesses a putative 3'-to-5' 
exonuclease (ExoN) domain of the DEDD superfamily 
(56). The putative nspl2 possesses a putative poly(U)- 
specihc endoribonuclease (XendoU) domain (27). The 
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Fig. 5. Multiple alignment of amino acid sequences of the helicases of CoV-FIKUl, MHV, BCoV, HCoV-OC43, 
SARS-CoV, HCoV-229E and IBV. The nine cysteine residues and three histidine residues that are conserved in the N 
termini of the helicases in nidoviruses are highlighted. The six conserved motifs are also depicted. 
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putative nspl3 possesses a putative S-adenosylmethio- 
nine-dependent ribose 2 '-O-methyltransferase (2 -O-MT) 
domain of the RrmJ family (8). These three putative 
enzymes, as well as ADRP and CPDase, are enzymes in 
RNA processing pathways. ExoN, XendoU and 2'-0- 
MT are enzymes in a small nucleolar RNA processing 
and utilization pathway, in contrast to the pre-tRNA 
splicing pathway that ADRP and CPDase belong to. At 
the moment, no experiments have been performed to 
confirm the activities of these putative RNA processing 
enzymes. Further studies are required to elucidate the 
exact roles of these putative viral enzymes in the corre¬ 
sponding viruses. 
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